Goto

Collaborating Authors

 vrsbench dataset



Bench LanguageBenchmark

Neural Information Processing Systems

Wefurther evaluated state-of-the-art models on this benchmark forthree vision-language tasks: image captioning, visual grounding, and visual question answering. Our work aims to significantly contribute to the development ofadvanced vision-language models inthefieldofremote sensing.